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l/y Abstract 

Principal angles between subspaces (PABS) (also called canonical angles) 
serve as a classical tool in mathematics, statistics, and applications, e.g., data 
mining. Traditionally, PABS are introduced and used via their cosines. The 
£h tangents of PABS have attracted relatively less attention, but are important 

^ for analysis of convergence of subspace iterations for eigenvalue problems. 

We explicitly construct matrices, such that their singular values are equal 
to the tangents of PABS, using several approaches: orthonormal and non- 
orthonormal bases for subspaces, and orthogonal projectors. 
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i—* troduced by Jordan pQ in 1875. Hotelling [2] defines PABS in the form of 

>• canonical correlations in statistics in 1936. Numerous researchers work on 

PABS; see, e.g., our pseudo-random choice of initial references [3J IH El El El IE] , 
out of tens of thousand of Internet links for the keywords "principal angles" 
and "canonical angles". 
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In this paper, we first briefly review the concept and some important 
properties of PABS in Section [2} Traditionally, PABS are introduced and 
used via their sines and more commonly, because of the connection to canon- 
ical correlations, cosines. The tangents of PABS have attracted relatively 
less attention, despite of the celebrated work of Davis and Kahan [5], which 
includes several tangent-related theorems. Our interest to the tangents of 
PABS is motivated by their applications in theoretical analysis of conver- 
gence of subspace iterations for eigenvalue problems. 

We review some previous work on the tangents of PABS in Section [3j The 
main goal of this work is explicitly constructing a family of matrices such 
that their singular values are equal to the tangents of PABS. We form these 
matrices using several different approaches: orthonormal bases for subspaces 
in Section [4], non-orthonormal bases in Section [5j and orthogonal projectors 
in Section [6] 

2. Definition and basic properties of PABS 

In this section, we remind the reader the concept of PABS and some 
fundamental properties of PABS. First, we recall that an acute angle between 
two unit vectors x and y, i.e., with x H x = y H y = 1, is defined as cos#(x, y) = 
\x H y\, where < 9(x,y) < ir/2. 

The definition of an acute angle between two vectors can be recursively 
extended to PABS; see, e.g., PHUICEI]. 

Definition 2.1. Let X C C n and y C C n be subspaces with dim(X) = p 
and dim(3^) = q. Let m = min (p, q). The principal angles 

®{X, y) = [0!, ...,9 m ], where 9 k e [0, tt/2], k = 1, . . . ,m, 

between X and y are recursively defined by 

Sk = cos(9k) = max max \x H y\ = \x h yu\, 
x&x y&y 

subject to 

\\x\\ = \\y\\ = 1, x H Xi = 0, y H yi = 0, i = l,...,k-l. 
The vectors {x\, . . . , x m } and {yi, . . . , y m } are called the principal vectors. 

An alternative definition of PABS, from [TUl [EE], is based on the singular 
value decomposition (SVD) and reproduced here as the following theorem. 
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Theorem 2.1. Let the columns of matrices X G C nxp and Y G C nx<? form 
orthonormal bases for the subspaces X and y, correspondingly. Let the SVD 
of X H Y be UT,V H , where U and V are unitary matrices and £ is a p x q 
diagonal matrix with the real diagonal elements si(X H Y), . . . , s m (X H Y) in 
nonincreasing order with m = min(p, q). Then 

cosQ\X,y) = S {X H Y) = [ Sl (X H Y) ,...,s m {X H Y)] , 

where Q^(X,y) denotes the vector of principal angles between X and y ar- 
ranged in nondecreasing order and S(A) denotes the vector of singular values 
of A. Moreover, the principal vectors associated with this pair of subspaces 
are given by the first m columns of XU and YV, correspondingly. 



Theorem 2.1 implies that PABS are symmetric, i.e. Q(X,y) = 0(y,X), 
and unitarily invariant, i.e., Q(UX ,Uy) = Q(X,y) for any unitary matrix 
U G C nxn , since (UX) H (UY) = X H Y. A number of other important prop- 
erties of PABS have been established, for finite dimensional subspaces, e.g., 
in jH O El [7J [8] , and for infinite dimensional subspaces in [31 [12] . We list a 
few most useful properties of PABS below. 



Property 2.1. [7] In the notation of Theorem 2.1, let p > q and let [X,X±] 
be a unitary matrix such that S(X^Y) = [si (X± Y ),..., s q (X±Y)} where 
si{X?Y) > > s q {X?Y). Then s k {XfY) = sin(0 9+1 _ fc ), k = 1, . . . ,q. 

Relationships of principal angles between X and y, and between their or- 
thogonal complements are thoroughly investigated in [H El [12] . The nonzero 
principal angles between X and y are the same as those between X 1 - and 
y L . Similarly, the nonzero principal angles between X and y x are the same 
as those between X x and 3^ 

Property 2.2. [HEMI2] Let the orthogonal complements of the subspaces X 
and y be denoted by X L and y L , correspondingly. Then, 

1. [e+(*,;y),o,...,o] = [e^,^),o,...,o" . 

2. [e^,^),o,...,o] = [e^,^),o,...,o . 

3. [f , . . . , f , &(x, y)] = [f - et(Ar, y- 1 ), o, . . . , o] , 

where there are max(dim(A') — dim(3^),0) additional 7r/2s on the left. 

Extra 0s at the end may need to be added on either side to match the sizes. 
Symbols 4- and t denote the components in vector arranged in nonincreasing 
and nondecreasing order, correspondingly. 

These statements are widely used to discover new properties of PABS. 



3 



3. Tangents of PABS 



The tangents of PABS serve as an important tool in numerical matrix 
analysis. For example, in [131 E], the authors use the tangent of the largest 
principal angle derived from a norm of a specific matrix. In [6, p. 231-232] 
and [151 Theorem 2.4, p. 252] the tangents of PABS, related to singular values 
of a matrix — without an explicit matrix formulation — are used to analyze 
perturbations of invariant subspaces. In [6j [131 E2 EES], the two subspaces 
have the same dimensions. 

Let the orthonormal columns of matrices X, Xj_, and Y span the sub- 
spaces X, the orthogonal complement X L of X, and y, correspondingly. 
According to Theorem 2.1 and Property 2.1, cos 0(^,3^) = S(X Y) and 
smQ(X,y) = S(X±Y). The properties of sines and especially cosines of 
PABS are well studied; e.g., in [71 [10]. One could obtain the tangents of 
PABS by using the sines and the cosines of PABS directly, however, in some 
situations this does not reveal enough information about their properties. 

In this work, we construct a family J 7 of explicitly given matrices, such 
that the singular values of the matrix T are the tangents of PABS, where 
T G J 7 . We form T in three different ways. First, we derive T as the product 
of matrices whose columns form the orthonormal bases of subspaces. Second, 
we present T as the product of matrices with non-ort honor mal columns. 
Third, we form T as the product of orthogonal projections on subspaces. 
To better understand the matrix T, we provide a geometric interpretation of 
singular values of T and of the action of T as a linear operator. 

Our constructions include the matrices used in [51 H31 HH EE] • Further- 
more, we consider the tangents of principal angles between two subspaces 
not only with the same dimensions, but also with different dimensions. 



4. tan © in terms of the orthonormal bases of subspaces 

The sines and cosines of PABS are obtained from the singular values 
of explicitly given matrices, which motivates us to construct T as follows: 
if matrices X and Y have orthonormal columns and X H Y has full rank, 
let T = XfY (X H Y)~\ then tun Q(X,y) could be equal to the positive 
singular values of T. We begin with an example in two dimensions in Figure [T] 

LetX=f COS fV X ± =f- Sin ^, and Y=(^M), 
\ snip J \ cosp J \ sin (0 + p) J 
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Figure 1: The PABS in 2D. 

where < 9 < tt/2 and (3 G [0, tt/2]. Then, X H Y = cos 9 and XfY = sin 9. 
Obviously, tan 9 coincides with the singular value of T = X±Y (X H Y) . 
If 9 = tt/2, then the matrix X H Y is singular, which demonstrates that, in 
general, we need to use its Moore-Penrose Pseudoinverse to form our matrix 
T = XfY [X H Y)\ 

The Moore-Penrose Pseudoinverse is well known. For a matrix A G C nxm , 
its Moore-Penrose Pseudoinverse is the unique matrix A' G C mxn satisfying 

AA^A = A, A^AA^ = A\ [AA^) H = AA\ {A^A) H = A^A. 

Some properties of the Moore-Penrose Pseudoinverse are listed as follows. 

• If A = UY1V H is the SVD of A, then A* = V^U H . 

• If A has full column rank and B has full row rank, then (ABY = B^AK 
However, this formula does not hold in general. 

• AA< is the orthogonal projector onto the range of A, and A' A is the 
orthogonal projector onto the range of A H . 

For additional properties of the Moore-Penrose Pseudoinverse, we refer the 
reader to |B|. Now we are ready to prove that the intuition discussed above, 
suggesting to try T = X±Y ( K X H Y^j\ is correct. Moreover, we cover the case 
of the subspaces X and y having possibly different dimensions. 
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Theorem 4.1. Let X e c nx P have orthonormal columns and be arbitrarily 
completed to a unitary matrix [X,X±\. Let the matrix Y e c nxg &e sitc/i 
i/iat = /. Then the positive singular values, denoted by S + (T), of the 

matrix T = XfY {X H Y) ] satisfy 

ttme(K(X),K(Y)) = [oo,...,oo,,S + (T),0,...,0], 

where IZ(-) denotes the matrix range. 



Proof. Let [ Y Y± ] be unitary, then [X X± ] H [Y Y± 
such that 



is unitary, 







n — q 


p 


' X H Y 


X H Y ± 


n — p 


[X»Y 


X»Y ± 



[ X X ± } H [ Y Y ± 



Applying the CS-decomposition (CSD), e.g., [T6l HZl HE] , we get 



[ X X x } H [ Y Y ± 



X H Y 


X H Y ± ' 






D 


XfY 


X»Y ± 






-p),V 1 eU(q), 


and V 2 ElA{n 


-q) 



Vo 



U(k) denotes the family of unitary matrices of the size k. The matrix D has 
the following structure: 



r s q—r—s n—p—q+r s p—i — s 



D 



r 
s 

p—r—s 


I 


c 


o c 


H 

^ s 


s 

I 


n—p—q+r 


o s 






-I 




s 




s 






-c 


q—r—s 






I 




Of . 



where C = diag(cos(#i), . . . , cos(0 s )), and S = diag(sin(6'i), . . . , sin(6 l s )) with 
6k G (0, 7r/2) for k = 1, . . . , s, which are the principal angles between the 
subspaces 7Z(Y) and 1Z(X). The matrices C and S could be not present. 
The matrices O s and O c are matrices of zeros and do not have to be square. 
/ denotes the identity matrix. We may have different sizes of / in D. 



6 



Therefore, 



XfY {X H Y) ] = U 2 



" o s 




" / 


s 


V»Vi 


C 


I 




o c 



Uo 



sc- 



H 

c 



u*. 



Hence, S+(T) = (tan(^), . . . ,tan(0 s )), where < 9 1 < 9 2 < ■ ■ ■ < 9 S < vr/2. 

From the matrix D, we obtain S (X H Y) = S (diag(J, C, O c )) . According 
to Theorem gl] e(K(X),K(Y)) = [0, . . . , 0, l5 . . . , 9 S , ir/2, . . . , tt/2], where 
£ (0, vr/2) for = 1, . . . , s, and there are min(g — r — s,p — r — s) additional 
7r/2's and r additional 0's on the right-hand side, which completes the proof. 

□ 

Remark 4.2. If more information is available about the subspaces X , y and 
their orthogonal complements, we can obtain the exact sizes of block matrices 
in the matrix D. Let us consider the decomposition of the space C n into an 
orthogonal sum of five subspaces as in 



c n = Ti 00 © an™ e an 



01 



1-10 



Win © OJl, 



where <m 00 = x n y, m 01 = x n y 1 , SDTio = ^03^, ajt u = at- 1 n y J 

(X = TZ (X) and y = 11 (Y)). Using Tables 1 and 2 in [12] we get 



dim(OJtoo) 
dim(mtio) 
dim(OJtn) 
dim(SWoi) 



r, 

q- 

n 
p- 



r - 
P 



g + r, 
s, 



w/jere SOT = 0Jt 



A' 



071 



y 



yyty± with 



m x =x n (OJtoo © SDToi) 
aji^±=Af ± n(ajiio©ajiii) 
0K y =y n(9H o©mt 10 ) 
ajiy±=y- L n(ajioi ©ajtu) 
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and s = dim(DJlx) = dim(Wly) = dim(9Jt Y - L ) = dim(9Jt;y±). Thus, we can 
represent D in the following block form 



dim(aJtoo) dim(9Jt)/2 dim(OT 10 ) dim(ajtu) dim(OT)/2 dim(OT i) 



dim(OToo) 
dim(2K)/2 
dim(OToi) 


/ 


c 


o c 


of 


s 


I 


dim(SHn) 
dim(9Jt)/2 
dim(OTio) 


O s 


s 


I 


-/ 


-c 


H 

c 



In addition, it is possible to permute the first q columns or the last n — q 
columns of D, or the first p rows or the last n — p rows and to change the 
sign of any column or row to obtain the variants of the CSD. 



Remark 4.3. From Remark \4-£\ we see that some identity and zero matrices 
may be not present in the matrix D. To obtain more details for D, we have to 
deal with D in several cases based on either p < q or q > p and either p+q < n 
or p + q > n. Since the angles are symmetric such that Q(X, y) = Q(y, X), 
it is enough to discuss one case of either p < q or q < p. Let us assume 
q < p which implies dim(9Jtoi) — dim(9Jtio) = p — q. 

Case 1: assuming p + q < n, we have (also see [HI [20] ), 



D 



Co 





So 





0(p- q )xq 








Ip-q 


—So 





Co 





(n—p—q) x q 











where Co = diag(cos(#i), . . . , cos(9 q )) and So = diag(sin(#jj 
8k £ [0, 7r/2] for k = I, . . . ,q. 

Case 2 : assuming p + q > n, we have 



, sin(0 g )) with 



D 



O 



Co 

0(p+q—n)x(n—p) Ip+q—n 



(n—p)x(p+q-n) 



o 



(p-q)x(n-p) 



o 



(p-q)x(p+q-n) 



-So 



O 



(n—p)x(p+q—n) 



So O 

o o 

O L 



p-q 



Co O 



where C = diag(cos(6 l 1 ), . . . , cos(6 ) n _ p )) and S = diag(sin(#ij 
with 9k G [0, 7r/2] for k = 1, . . . , n — p. 



,sin(0„_ p )) 
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Due to the fact that the angles are symmetric, i.e., Q(X,y) = Q(y,X), 



the matrix T in Theorem 4.1 could be substituted with Y±X(Y H Xy. More- 
over, the nonzero angles between the subspaces X and y are the same as 
those between the subspaces X 1 - and y L . Hence, T can be presented as 
X H Y ± (X^Y ± y and Y H X ± (YfX ± y. Furthermore, for any matrix T, we 
have S(T) = S(T H ), which implies that all conjugate transposes of T are 
valid. Let J 7 denote a family of matrices, such that the singular values of the 
matrix in T are the tangents of PABS. To sum up, any of the formulas for 
T in T in the first column of Table Q] can be used in Theorem 14.11 



Table 1: Different matrices in T: matrix T G T using ortho-normal bases. 



XfY (X H YY 


P X ±Y (X h yY 


Yfx(Y M xy 


Py±x{Y H xy 


X H Y ± (X»Y ± y 


P x Y ± (X£Y ± y 


Y H X ± (Y£X ± y 


PyX ± (Y»x ± y 


(Y H xyY H x ± 


(Y H XyY H P x ± 


[x H Yyx H Y ± 


(x H Yyx H p y ± 


(YfX ± yYfX 


(YfX ± yYfP x 


(X?Y ± yx£Y 


(x«Y ± yx«Py 



Using the fact that the singular values are invariant under unitary multi- 
plications, we can also use P x ±Y(X H Yy for T in Theorem 4.1, where P x ± 
is an orthogonal projector onto the subspace X ± . Thus, we can use T as in 
the second column in Table [TJ where P x , Py, and Py± denote the orthogonal 
projectors onto the subspace X, y, and y L , correspondingly. 



Remark 4.4. From the proof of Theorem 4-P we immediately derive that 



XfY (X H Y) j = -(YfX ± yYfX, and YfX(Y H Xy = —(XfY±yX±Y. 

Remark 4.5. Let XfY (X H Y) j = U 2 ^iU 1 and YfX(Y H Xy = V 2 ^ 2 V X , be 
the SVDs, where Ui and Vi (i — 1,2) are unitary matrices and Ej (i = 1,2) 
are diagonal matrices. Let the diagonal elements of Si and S 2 be in the same 
order. Then the principal vectors associated with the pair of subspaces X and 
y are given by the columns of XU\ and YV± . 



5. tan © in terms of the bases of subspaces 

In the previous section, we use matrices X and Y with orthonormal 
columns to formulate T. It turns out that we can choose a non-orthonormal 
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basis for one of the subspaces X or y. 

Theorem 5.1. Let [X, X±] be a unitary matrix with X e C nxp . Let 
Y G C nX9 and rank (Y) = rank \X H Y) < p. Then the positive singu- 
lar values of the matrix T = XfY (X H Y) j satisfy ttmQ(ll{X),K(Y)) = 

s.(T)A) o]. 

Proof. Let the rank of F be r, thus r < p. Let the SVD of Y be U"EV H , 
where U is an n x n unitary matrix; £ is an n x q real diagonal matrix with 
diagonal entries s\, . . . , s r , 0, . . . , ordered by decreasing magnitude such that 
Si > S2 > ■ ■ ■ > s r > 0, and V is a qxq unitary matrix. Since rank(F)=r < q, 
we can compute a reduced SVD such that Y = U r T: r V r . Only the r column 
vectors of U and the r row vectors of V H , corresponding to nonzero singular 
values are used, which means that S r is an r x r invertible diagonal matrix. 
Based on the fact that the left singular vectors corresponding to the non-zero 
singular values of Y span the range of Y, we have 

tan e(K(X), TOY)) = tan G(K(X), K(U r )). 

Since ia.nk(X H Y) = r, we have rank(X H ?7 r S r ) = rank(X H U r ) = r. Let 



T\ = X±U r iyX H Ur) . According to Theorem 



4.1 



it follows that 



ta,nQ(K{X),K(U r )) = [S+iTt), 0, . . . , 0]. 

It is worth noting that the angles between TZ(X) and lZ(U r ) are in [0,7r/2), 
since X U r is full rank. 

Our task is now to show that T\ = T. By direct computation, we have 

T = XfY (X H Y) ] 

= xf u r KV r H (x H u r z r v r H y 

= xfu r E r v r H (v r n y (x H u r z r y 

= XfU r ^ r (X H U r ^ r ) ] 

= XfU r ^ r Ysl(X H U r ) ] 

= xfu r (x H u r y. 

In two identities above we use the fact that if a matrix A is of full col- 
umn rank, and a matrix B is of full row rank, then (ABy = B^A* 1 . Hence, 
ta,nQ(n(X),Tl(Y)) = [S+(T), 0, . . . , 0] which completes the proof. □ 
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Remark 5.2. Let X H Y have full rank. Let us define Z = X+Xj_T following 
p.231-232] and [15, Theorem 2.4, p.252]. Since (X H Y) ] X h Y = I, the 
following identities Y = P X Y + P x xY = XX H Y + X ± XfY = ZX H Y imply 
that TZ(Y) C 1Z(Z). By direct calculation, we obtain that 



X H Z = X H (X + X ± T) = I and Z H Z 



{X + X ± T) H {X + X ± T) 



I + T H T. 



Thus, X H Z {Z H Z) — (I + T H T)~ 1 / 2 is Hermitian positive definite. 
The matrix Z(Z H Z)~ 1 / 2 by construction has orthonormal columns which 
span the space Z . Moreover, we observe that 

& {X H Z(Z H Zy l l 2 ) = At ((/ + T H T)- 1 ' 2 ) = {l + S 2 (T))' 1/2 . 

Therefore, tan Q(TZ(X), K(Z)) = [S+(T), 0, . . . , 0] and dim (X) = dim(Z). 



From Theorem \5J\ we have tan&(K(X),K(Y)) C taaQ(K(X), K(Z)). It 
is worth noting that Q{Tl{X) ,K{Y)) and G(K(X),K(Z)) in (0,vr/2) are the 
same. For the case p = q, we have that Q(1Z(X),1Z(Y)) = Q(TZ(X),TZ(Z)). 
This gives us an explicit matrix expression XfY (X H Y) for the matrix P 
in P p.231-232] and [El Theorem 2.4, p.252]. 

Remark 5.3. The condition rank (Y) = rank ( K X H Y^j < p is necessary in 



Theorem 5.1. For example, let 



X 









), *h 


1 


D 








' H Y = ( 1 


i ) 


and 



and Y 



^YY = ( 1/2 1/2 ) H . Thus, 

) H , and s{T) = 1/2. On the other hand, 
we obtain that ta n Q( TZ(X), 7Z(Y)) = 0. From this example, we see that the 

does not hold for the case rank(F) = rank (X H Y) > p. 



T = XfY (X H Yy = ( 1/2 



result in Theorem 



5.1 



Corollary 5.1. Using the notation of Theorem 5.1 , let Y be full rank and 
p = q. Let P x ± be an orthogonal projection onto the subspace TZ(X±). Then 

tanQ(n(X),n(Y)) = S (p x ±Y (X H Yy 

Proof. Since the singular values are invariant under unitary transform, it 
is easy to obtain S (xf Y (X H Y)' 1 ^j , 0, . . . , = S (p x ±Y (X^F) -1 ). 

Moreover, the number of the singular values of P X ±Y (X H Y^ 1 is p, hence 
we obtain the result as stated. □ 
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Above we construct T explicitly in terms of bases of subspaces X and y. 
Next, we provide an alternative construction by adopting only the triangular 
matrix from the QR factorization of a basis of the subspace X + y. 

Corollary 5.2. Let X G C nxp and Y G <C nxq be matrices of full rank where 
q < p. Let the triangular part R of the reduced QR factorization of the matrix 
L = [X, Y] be 

Ru R12 
O R 22 



R 



where R12 is full rank. Then the positive singular values of the matrix T 
RwiRnV satisfy tan Q(n(X),1l(Y)) = [S+(T), 0, . . . , 0]. 



Proof. Let the reduced QR factorization of L be 
[ X Y ] = [ Q x Q 2 



Ru R12 
O R22 



where the columns of [ Qi Q2 ] are orthonormal, and TZ(Qi) = X. We 
directly obtain Y = Q1-R12 + Q2-R22- Multiplying by Qf on both sides of the 
above equality for Y, we get R X2 = Q\Y. Similarly, multiplying by Q% we 
have R22 = Q^,Y. Combining these equalities with Theorem 5.1 completes 
the proof. □ 



6. tan © in terms of the orthogonal projections onto subspaces 

In this section, we present T using orthogonal projectors only. Before we 
proceed, we need a lemma and a corollary. 

Lemma 6.1. [6] IfU and V are unitary matrices, then for any matrix A we 
have (UAVy = V H A^U H . 



Corollary 6.1. Let A G C nxn be a block matrix, such that 

A -- 



D 12 

O21 O22 



where D is a p x q matrix and O12, O21, and O22 are zero matrices. Then 

^12 ^22 
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Proof. Let U{n) denote a family of unitary matrices of size n by n. Let the 
SVD of D be UiYVf , where £ is a p x g diagonal matrix and £/i G 
Vi EU (g). Then there exist unitary matrices U 2 G W (n—p) and V2 G W (n—q), 
such that 

" " £ 12 

O21 O22 



.4 



C/ 2 



V{ 



H 



V* 



Thus, 



A' 



' Vi 






' s 


Oi 2 " 




v 2 




021 




' V\ 












v 2 




w 12 


w 22 



u» 



u 2 



Since = ViTtU?, we have A 



'12 



Of, E 



i2 
21 

22 



as desired. 



□ 



Theorem 6.2. Lei Py± and fre orthogonal projectors onto the sub- 
spaces X , X 1 - and y , correspondingly. Then the positive singular values 
S+(T) of the matrix T = P x ± {P x Py)' i satisfy 

tan Q(X,y) = [oo,...,oo,S+(T),0,...,0]. 

Proof. Let the matrices [ X Xj_ ] and [Y Y± ] be unitary, where 1Z(X) = 
X and 7Z{Y) = y. By direct calculation, we obtain 



X 
X 



PxPy[ Y Y ± 



X H Y O 

o o 



The matrix PxPy can be written as 
PxPy = [ X Xj_ 



' X H Y ' 




- yU - 







Yf _ 



From Lemma |6.1| and Corollary |6.1[ we have 

(P X PyY = [Y Y± ' 

In a similar way, 

P x ±Py = [X ± X 



(x H Yy o 
o o 



X H 
X" 



' XfY ' 











Yf _ 



(1) 



(2) 



(3) 
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Let T = P x ±P y {P X Py) ] and B = XfY (X H Y)\ so 



T 



[X ± X 
[X ± X 



XfY 











B ' 










(x H Yy o 
o o 



X H 

xY 



x H 

X? 



The singular values are invariant under unitary multiplications, therefore 
S + (T) = S + (B). 

Finally, we show that P x ±Py (PxPy)" 1 = Px± (PxPy)^ ■ The null space 
of PxPy is the orthogonal sum y 1 - © (y H X 1 ). The range of (PxPyY is 

thus y fl (y PI X 1 -^ , which is the orthogonal complement of the null space 
of PxPy- Moreover, (P x PyY is an oblique projector, because (PxPyY = 
((P x P y y) which can be obtained by direct calculation using equality (|2j). 
The oblique projector (PxPyY projects onto y along X ± . Thus, we have 
P y {PxPy) j = {PxPy) j ■ Consequently, we can simplify T = P x ±Py (PxPy) 
as T = P^x {P x Py) ] . □ 




Figure 2: Geometrical meaning of T = Px± (PxPy) ■ 

To gain geometrical insight of T, in Figure [2] we choose a generic unit 
vector z. We project z onto y along X ± , then project onto the subspace X 1 - 
which is interpreted as Tz. The red segment in the graph is the image of T 
under all unit vectors. It is straightforward to see that s(T) = \\T\\ = tan(#). 
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Using Property 2.2 and the fact that the principal angles are symmetric 
with respect to the subspaces X and y, t he e xpressions Pyx(PyP x y and 
P x (P x ±Py±Y can also be used in Theorem 6.2 



Remark 6.3. According to Remark 4^4_ an d Theorem 6.2 , we have 

P X {P x xPyx)^ = P X Pyx{P x xPyx)^ = - (p x x (P X Py)^ H . 

Furthermore, by the geometrical properties of the oblique projector [P x Pyy , 
it follows that (P x P y y = P y (P x P y y = (P x P y yP x . Therefore, we obtain 

P X ± {PxPy) ] = Px^Py {PxPy) ] = (Py ~ P x ){P x Py) ] ■ 
Similarly, we have 

Pyx{PyP x y = PyxP X (PyP x y = (P X - Py) (Py P X )^ = - (Py(P y xP x ^) H . 

From equalities above, we see that there are several different expressions for 
T in Theorem \6.2\ 

Theorem 6.4. For subspaces X and y, 

1. If dim(X) < dim(y), we have P x P y (P x P y y = P x (P x P y y = P x . 

2. If dim(X) > dim(y), we have {P x Py)^P x Py = (P x Py)^Py = Py. 

Proof. Using the fact that X H Y(X H Yy = I for dim(A') < dim(^) and 
combining identities ([TJ and ([2]), we obtain the first statement. Identities Q 
and @, together with the fact that {X H Y)^X H Y = I for dim(X) > dim(^), 
imply the second statement. □ 



Remark 6.5. From Theorem 6.4. for the case dim(X) < dim(^) we have 
P x x(P x Py)i = [P X Py) ] - P X {P X Py) ] = {P X Py) ] ~ P X . 



Since the angles are symmetric, using the second statement in Theorem \6. 4 
we have {P x xP y x)^P y = {P x xP y x)^ - P y x. 

On the other hand, for the case dim(A') > dim(^) we obtain 



(P X P y yPyX = (P X P y y - Py ^d{P X xPyx)^P X = {P X xPyX^ 



P 



To sum up, the following formulas for T in Table [£] can also be used in 
Theorem 6.2. An alternative proof for T = (P x Py) jf — Py is provided by 
Drmac in [H] for the particular case dim(X) = dim(y). 
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Tabic 2: Different formulas for T: left for dim(AT) < dini(^); right for dim(X) > dim(y). 





(P X P y y - Py 


(P^xPyx)' - PyX 


(p x ±p y ±y-p x ± 



Remark 6.6. Finally, we note that our choice of the space % = C™ may 
appear natural to the reader familiar with the matrix theory, but in fact is 
somewhat misleading. The principal angles ( and the corresponding principal 
vectors) between the subspaces X C T-L and y C H are exactly the same as 
those between the subspaces X C X + y and y C X + y, i.e., we can reduce 
the space T-L to the space X + y C T-L without changing PABS. 

This reduction changes the definition of the subspaces X 1 - and y L and, 
thus, of the matrices X± and Y± that column-span the subspaces X 1 - and 
y L . All our statements that use the subspaces X 1 - and y 1 - or the matrices 
X± and Y± therefore have their new analogs, if the space X + y substitutes 
for H. The formulas P x ± = I — P x ± and Py± = I — Py±, look the same, but 
the identity operators are different in the spaces X + y and T-L. 
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